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Abstract 

We introduce an atomic formula y _Lj z intuitively saying that the variables 
y are independent from the variables z if the variables x are kept constant. We 
contrast this with dependence logic T> [7] based on the atomic formula — (x,y), 
actually equivalent to y J-g y, saying that the variables y are totally determined 
00 ' by the variables x. We show that y z gives rise to a natural logic capable of 

<^^> formalizing basic intuitions about independence and dependence. We show that 

y ± 3 z can be used to give partially ordered quantifiers and IF-logic an alternative 
interpretation without some of the shortcomings related to so called signaling that 

■ interpretations using —(x, y) have. 
• i-H ■ 

■ Of the numerous uses of the word "dependence" we focus on the concept of an at- 
$H \ tribute 1 depending on a number of other similar attributes when we observe the world. 



We call these attributes variables. We follow the approach of [7] and focus on the 
strongest form of dependence, namely functional dependence. This is the kind of de- 
pendence in which some given variables absolutely deterministically determine some 
variables, as surely as x and y determine x + y and x ■ y in elementary arithmetic. The 
idea is that weaker forms of dependence can be understood in terms of the strongest. 
Functional dependence of x on y is denoted in [7] by the symbol =(y, x). If we adopt 
the shorthand 

={y, x) for =(y, xi) A ... A ={y, x„) 
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we get a more general functional dependence. Although there are many different intu- 
itive meanings for =(y, x), such as "y totally determines x" or "x is a function of y", 
the best way to understand the concept is to give it semantics: 

Definition 1 ([5, 7]). Sets of assigments are called teams. A team X satisfies =(y, x) 
in M, in symbols M \= x ={y, x), or just X |= —(y, x), if 

Vs, a' G X(s(y) = s'(y) -»■ s(x) = s'(x)). (1) 

Condition (1) is a universal statement. As a consequence it is closed downward, 
that is, if a team satisfies it, every subteam does. In particular, the empty team satisfies 
it for trivial reasons. Also, every singleton team {s} satisfies it, again for trivial reasons. 

Functional dependence has been studied in database theory and some basic prop- 
erties, called Armstrong's Axioms have been isolated [1]. These axioms state the 
following properties of =(y, x): 

1. =(x, x). Anything is functionally dependent of itself. 

2. If = (y, x) and y C z, then = (z, x). Functional dependence is preserved by 
increasing input data. 

3. If j? is a permutation of z, u is a permutation of x, and = (z, x), then = (y, u). 
Functional dependence does not look at the order of the variables. 

4. If =(y, z) and =(z, x), then =(y, x). Functional dependences can be transitively 
composed. 

These rules completely describe the behavior of =(y, x) in the following sense: If 
T is a finite set of dependence atoms of the form = (y, x) for various x and y, then 
= (y, x) follows from T according to the above rules if and only if every team that 
satisfies T also satisfies =(y, x). Let us see how Armstrong [1] proved this: Suppose 
T \= =(y, x), i.e. every team satisfying T satisfies =(y, x). Let z 3 ybe the list of 
variables z such that ={y, z) can be derived using the rules (l)-(4) from T. Let x G x. 
We show x G z. Suppose not. Let X be the team {s, s'}, where s(z) = s'(z) = 
for all z G z, but s(u) = 0, s'(u) = 1 for all u ^ z. Note that X y= ={y, x) because 
x £ z. So it suffices to show X |= T. Suppose = (u, v) G T. Let v G v. We show 
X |= =(u, v). If Sfl — z ^ 0, then s(u) ^ s'(u). So w.l.o.g. u C z. If v is not x, 
then s(v) = s'(v). So w.l.o.g. v is the variable x. But now transitivity gives x G z, 
contradiction. QED 

We shall now give the concept of independence a similar treatment as we gave 
above to the concept of dependence. Again we start from the strongest conceivable 
form of independence of variables x and y, a kind of total lack of connection between 
them, which we denote x±y. We can read this in many ways: 

• x and y are completely independent from each other. 

• x and y occur totally freely. 

• x and y give absolutely no information of each other. 
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• Every conceivable pattern occurs for x and y. 

Suppose balls of different sizes and masses are dropped from the Leaning Tower of 
Pisa in order to observe how the size and mass influence the time of descent. One may 
want to make sure that in this test: 

The size of the ball is independent of the mass of the ball. (2) 

How to make sure of this? Ideally one would vary the sizes and the masses freely so 
that if one mass is chosen for one size it would be also be chosen for all the other sizes, 
and if one size is chosen for one mass it is also chosen for all other masses. This would 
eliminate any dependence between size and mass and the test would genuinely tell us 
something about the time of descent itself. We would then say that the size and the 
mass were made independent of each other in the strongest sense of the word. 
Suppose we have data about tossing two coins and we want to state: 

Whether one coin comes heads up is independent 
of whether the other coin comes heads up. 

To be convinced, one should look at the data and point out that all four possibilities 
occur. Probability theory has its own concept of independence which however is in 
harmony with ours, only we do not pay attention to how many times certain pattern 
occurs. In probability theory, roughly speaking, two random variables are independent 
if observing one does not affect the (conditional) probability of the other. We could 
say the same without paying attention to probabilities as follows: two variables are 
independent if observing one does not restrict in any way what the value of the other 
is. 

If we look at any demographic data except for rather small data we may observe: 

A person's gender is independent 

of whether the person speaks Spanish. 

We would use a given data as support of the truth of this by finding in the data a 
male and a female that speak Spanish, and a male and a female that do not speak 
Spanish. Once this is established it would be rather difficult to claim that there is some 
dependence in the given data between the gender and the ability to speak Spanish. 
Of course this analysis again ignores the probabilities. That is, the data may have 
many females who speak Spanish but only one male that speaks Spanish and still our 
requirement for independence would be satisfied. This just shows that our criterion is 
really of a logical kind, not of a probabilistic kind. 

When Galileo dropped balls of the same size from the Leaning Tower of Pisa he 
was able to observe: 

Their time of descent is independent of their mass. (5) 

What did this mean? It means that each ball has the same time of descent, a constant, 
and therefore Galileo could conclude that it is independent of the mass. Being constant 
is a kind of strong independence different from the above examples where we empha- 
sized that in a sense all possible patterns should occur. When Galileo dropped balls 
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from the tower he did not observe all possible patterns and still he was able to conclude 
a certain independence. 

Einstein stated in his theory of special relativity that: 



The speed of light is independent of the observer's state of motion. 



(6) 



This is another famous example of independence where one of the variables is constant. 
Of course the constancy of the speed of light was not considered a scientific fact at the 
time, although observations supported it. 

So we should accept that one form of total independence is when one of the vari- 
ables is a constant. 

Another feature of the strongest possible independence is symmetry. In our ex- 
ample (2)-(6) there is a total symmetry of the variables. There are weaker forms of 
independence where symmetry is not present. For example: The result of collecting 
data about trading might support the claim: 



However, there would be no reason to believe that as a consequence: 

Non-public information about the company is independent of 
this investor's trading. 

Let us now introduce the semantics of x _L y: 

Definition 2. A team X satisfies the atomic formula x _L y if 



What this definition says is the following criterion for a team X of "data" to man- 
ifest the independence of x and y: Knowing s(x) alone for a given s G X gives no 
information about s(y), because there may be s' G X with s'(y) ^ s(y), and then (9) 
gives s" G X with s"(x) = s(x) and s"(y) — s'(y). So just when we were going to 
say that s(x) is enough evidence to conclude that the value of y is s(y), we see this 
other s" with the same value for x but a different value for y. 

We can immediately observe that a constant variable is independent of every other 
variable, including itself. To see this, suppose x is constant in X. Let y be any variable, 
possibly y = x. If s, s' G X are given, we need s" G X such that s" (x) = s(x) and 
s"(y) = s'(y). We can simply take s" = s'. Now s"(x) = s(x), because x is constant 
in X. Of course, s"(y) = s'(y). Conversely, if x is independent of every other variable, 
it is clearly constant, for it would have to be independent of itself, too. So we have 



We can also immediately observe the symmetry of independence, because the cri- 
terion (9) is symmetrical in x and y. More exactly, s" (y) = s{y) A s"{x) = s'(x) and 
s"(x) = s'(x) A s"(y) = s(y) are trivially equivalent. 

Our observations on constancy and symmetry lead to the following definition: 



This investor's trading is independent of non-public 
information about the company. 



(7) 



Vs, s' G X3s" G X(s"{y) = s{y) A s"(x) = s'(x)). 



(9) 
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Definition 3. The following rules are called the Independence Axioms 

1. Ifx _L y, then y _L x (Symmetry Rule). 

2. Ifx _L x, then y _L x ( Constancy Rule). 

It may seem that independence must have much more content than what these two 
axioms express, but they are actually complete in the following sense: 

Theorem 4 (Completeness of the Independence Axioms). If T is a finite set of de- 
pendence atoms of the form u _L v for various u and v, then i/li follows from T 
according to the above rules if and only if every team that satisfies T also satisfies 

y -J- x. 

Proof. Let us see how this follows: Suppose T \= j/lx. We try to derive y _L x from 
T. If y _L x G T or x _L y G T, we are done by the Symmetry Rule. So we assume 
y _L x T and x ± y ^ T. Let V be the set of variables z such that z _L z G T. If 
.t G V" or y G V, we are done by the Constancy Rule, so we assume V D {x, y} = 0. 
Consider a domain consisting of V and two new elements and 1. For d G {0, 1} let 
Xd consist of all s such that ifv£V, then s(v) = v, and moreover s(y) — s(x) = d. 
Finally, let X = XqUXl 

Let us first observe that X y= x _L y, because there are s G X with s(x) =0 and 
s' e X with s'(j/) = 1 but there is no s" G X such that both s"(x) = and s"(j/) = 1. 

Let us then show that X \= T. Suppose u _L v G T. If u G F or v G V, then 
clearly X |= u _L v. So we may assume {u, v} n V = 0. In particular, u ^ v. Suppose 
u = x, u = y, v = x ovv = y. By symmetry we may assume u = x. We may then 
also assume v ^ {x, y} for otherwise we are done. Let now s, s' G X be arbitrary. Let 
s"(u) — s(u), s"(v) = s'(v) and s"(w) — w for w G V. Then s" G A. So we have 
proved A |= w _L v in this case. The final case is that {u, v} n {x, y} = 0. In this case 
it is trivial that A |= u _L v. 

□ 

The independence atom y -L x turns out to be a special case of the more general 
notion 

y -Ls z 

the intuitive meaning of which is that the variable y are totally independent of the 
variables z when the variables x are kept fixed. 

Suppose objects of different forms (balls, pins, etc), different sizes and different 
masses are dropped from the Leaning Tower of Pisa in order to observe how the form, 
size and mass influence the time of descent. One may want to make sure that in this 
test: 

For a fixed form, the size of the object is 
independent of the mass of the object. 

How to make sure of this? Ideally one would vary for each form separately the sizes 
and the masses freely so that if one mass is chosen in that form for one size it would 
be also be chosen in that form for all the other sizes, and so on. We would then say 
that the size and the mass were made independent of each other, given the form, in the 
strongest sense of the word. 
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We now give exact mathematical content to y _Lj z: 

Definition 5. A team X satisfies the atomic formula y z if for all s,s' £ X such 
that s(x) — s'(x) there exists s" £ X such that s" (x) = s(x), s"(y) — s(y), and 
s"{z) = *'(*)). 

In the case of the sentence (10) this means the following: A set of observation 
concerning the falling objects is said to satisfy the requirement (10) if for any two tests 
s and s' where the form of the objects was the same there is a test s" still with the same 
form but which picks the size from test s and the weight from test s'. Note that this is 
in harmony with there having been just one test, but of course no scientific experiment 
would be satisfactory with just one test. So when there are several tests the requirement 
of (10) being satisfied actually pushes the number of tests up. 

Here are some elementary properties of y ±g z: 

Lemma 6. =(x, y) logically implies y ±g z. 

Proof. Suppose X satisfies = y). To prove that X satisfies y ±g z, let s, s' £ X 
such that s(x) = s'(x). Note that s(y) = s'(y). We can choose s" = s', for then 

s" e X, s"{x) = s(x), s"{y) = s(y), and s"{z) = s'(z). □ 

Lemma 7. y .Lg z logically implies —(x, y n z). 

Proof. Suppose X satisfies y ±g z. To prove that X satisfies =(x, y n z), let s, s' 6 X 
such that s(x) = s'(x). We show s(y n z) — s'(y n z). Let us choose s" G X 
such that s"{x) = s(x), s"{y) — s(y), and s"{z) — s'(z). Let w £ y n z . Then 
s(w) = s"(w) = s'(w). □ 

Corollary 8. ={x,y) y L 3 y 

So dependence is just a special case of independence, when independence is defined 
in the more general form. This has the pleasant consequence that when we define 
independence logic X by adding the atomic formulas y ±g z to first order logic, we 
automatically include all of dependence logic. 

We get the following reformulation of the corollary: 

Corollary 9. y y ' y _Lj z (Constancy Rule) 

Here are some rather trivial properties 
Lemma 10. 1. x ±g y (Reflexivity Rule) 

2. z y => y ±g z ( Symmetry Rule) 

3. yy' ±g zz' y _Lj z. (Weakening Rule) 

4. If z' is a permutation of z, x' is a permutation ofx, y' is a permutation ofy, then 
y z ^> y' z'. (Permutation Rule) 

A little less trivial are the following properties: 

Lemma 11. 1. z ±s y => yx ±g zx (Fixed Parameter Rule) 
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2. x -Lgy Au -Lgs y ^ u Lg y. (First Transitivity Rule) 

3. y ±z y A zx -Lg u =>■ x u (Second Transitivity Rule 1 ) 
Note that the Second Transitivity Rule gives by letting u = x: 

y y A x ±y x => x ±g x, 

which is the transitivity axiom of functional dependence. In fact Armstrong's Axioms 
are all derivable from the above rules. It remains open whether our rules permit a 
completeness theorem like Armstrong's Axioms do 3 . 

Now we can define a new logic by adding the independence atoms y ±g z to first 
order logic just as dependence logic V was defined in [7]: 

Definition 12. We define independence logic X as the extension of first order logic by 
the new atomic formulas 

for all sequences y, x, z of variables. The negation sign —i is allowed in front of atomic 
formulas. The other logical operations are A, V, 3 and V. The semantics is defined for 
the new atomic formulas as in Definition 5 and in other cases exactly as for dependence 
logic in [7]. The negation of y _Lj z is satsified by the empty team alone. 

There is an obvious alternative game-theoretic semantics based on the idea that a 
winning strategy should allow "mixing" of plays in the same way Definition 5 mixes 
assignments s and s' into a new one s". As we see below, this means that the existential 
player cannot use her own moves to code signals to herself and thereby go around 
requirements of imperfect information. 

Let us recall the following characterization of dependence logic 4 : 

Theorem 13 ([6]). The expressive power of formulas <j)(x\, ...,x n ) of dependence logic 
is exactly that of existential second order sentences with the predicate for the team 
negative. More exactly, let us fix a vocabulary L and an n-ary predicate symbol S £ L. 
Then: 

• For every L-formula (j)(x\, ...,x n ) of dependence logic there is an existential 
second order L U {S}-sentence $(5), with S negative only, such that for all 
L-structures M and all teams X: 

M\=x<Kx u ...,x n ) M\=$(X). (11) 

• For every existential second order L U {S}-sentence & {S), with S negative only, 
there exists an L-formula <j)(x\ , x n ) of dependence logic such that (11) holds 
for all L-structures M and all teams X ^ 0. 

2 We are indebted to P. Constantinou and P. Dawid for pointing out an error in our original formulation. 

3 We are grateful to F. Engstrom for pointing out that this problem is already solved in a slightly different 
formulation in database theory [2]. 

4 A team X can be thought of as a relation rcl(X) by identifying an assignment s with domain 
{xi, ...,x n } with the n-tuple (s(xi), ...,s(x„)). With this in mind, we use the notation M \= 
for(M,rel(X)) |= $(5). 
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The question arises, whether there is a similar characterization for independence 
logic. We do not know the answer. However, we may note the following: 

Proposition 14. The expressive power of formulas 4>{x\, x n ) of independence logic 
is contained in that of existential second order sentences with a predicate S for the 
team. More exactly, let us fix a vocabulary L and an n-ary predicate symbol S £ L. 
Then for every L-formula (f>(x\, ...,x n ) of independence logic there is an existential 
second order L U {S}-sentence T,p(S) such that for all L-structures M and all teams 
X:M\=xd>(x u ...,x n ) <=► Mh#). 

Proof. The construction of T^ Xlt ___ tXn )(S) is done by induction on (j>(x\ x n ). The 
construction is done exactly as in [7, Theorem 6.2] except for the independence atoms. 
Consider an independence atom 

Xi-^-xj Xk • 

In this case r^( Xlj ... iXn )(S') would be: 

Vj/i . . . y„Vzi . . . z n ((S(y) A S(z) A Vj = z 3 ) -> 

3ui . . . 3u n (S(u) A Uj = yj A u { = y l A u k = z k )). 

It is obvious how to generalize this to independence atoms among tuples (xi)i e j, 

(xj)j£j and (xk)keK for any index sets /, J, K C {1, . . . , n). □ 

For sentences tpofXwe define as for T>: M \= <j> M 4>- 

Corollary 15. For sentences independence logic and dependence logic are equivalent 
in expressive power. 

Proof. Suppose is a sentence of independence logic. There is an existential second 
order sentence (S) such that for every model M we have 

M |= t*({0}). 

By Theorem 13, there is a sentence tp of dependence logic such that for every model 
M we have 

M \= r ({0}) <=> M h 4>- 
Thus the sentences <fi and tp are equivalent. □ 

Note that formulas of independence logic need not be closed downward (i.e. M 
<p and Y C X need not imply M \=y <P), for example x _L y is not. This is a big differ- 
ence to dependence logic. Still, the empty team satisfies every independence formula. 

The sentence 

\/x\fy3z(z _L x A z = y) 

is valid in harmony with the intuition that the existential player should be able to make 
a decision to be independent of x when she chooses z whether she lets z = y or not. 
The sentence 

Vx3j/3z(z _L x A z = x) 
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is not valid in harmony with the intuition that the existential player needs to follow 
what the universal player is doing with his x in order to be able to hit z = x. In 
independence friendly logic ([4]) the sentence 



\/x3y3z/x(z = x), 



is valid which is often found counter-intuitive. The trick (called "signaling") is that 
the existential player stores the value of x into y and then chooses z on the basis of 
y, apparently not needing to know what ir is. In fact one might consider the entire 
independence friendly logic with the following interpretation: 



as an alternative to the usual one based on =(z, x). As we have seen above the inter- 
pretation (12) is not necessarily entirely faithful. However, the atom y L% x has one 
clearly distinguishable meaning of independence of y from x so it might be interesting 
to look at independence friendly logic with this interpretation. 

Our independence atom works well also in giving partially ordered quantifiers com- 
positional semantics as the following lemma illustrates: 

Lemma 16. Suppose <fi(x, y, u, v, z) is a first order formula. Then the following con- 
ditions are equivalent: 



(2) M \={ s \ yx3y\/u3v(v x A <j>(x, y, u, v, z)) 

The implication from (1) to (2) is obvious. The converse is a consequence of the 
fact that the implication 



is valid in all teams. Suppose not. Then there is a team X satisfing the left side, with 
two assigments s,s' e X such that s(u) = s'(u) but s(v) ^ s'(v). Since x and u 
determine v it then follows that s(x) ^ s'(x). Using independence, we infer that there 
exists s" G X with s"(u) = s(u) = s'(u), s"(x) = s(x), and s"(v) — s'(v) ^ s(v) 
contradicting the assumption that x and u determine v. 

Remark. In item (2) we might use instead of v _L Uj? x also the stronger condition 
uv On the other side the weaker condition v ±gx does not suffice. At first sight, 

implications of the form (~(x, u, v) A v±x) =>■ = (u, v) (saying that if a variable is 
completely determined by x and u and independent from x, then it is already deter- 
mined by u) might seem plausible. However, it is not valid in general, and it is quite 
easy to construct a counterexample over a domain of three values. 

An interesting question raised by the discussion above concerns the power of in- 
dependence logic for expressing properties of teams. Given that independence logic 
can be embedded into existential second-order logic, and that, on finite structures, ex- 
istential second-order logic captures the complexity class NP, we may formulate this 
for finite structures as follows: 



[3x/ycf)(x, y, z}]* = 3x(y x A [</>(x, y, z)}*) 



(12) 




(~(x,u,v) AvL u x) => =(u,v) 
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Problem: Characterize the NP properties of teams that correspond to formulas of in- 
dependence logic. 

Note that the corresponding question for dependence logic is solved by Theorem 13: 
Dependence logic can define exactly those NP properties of teams that are expressible 
in existential second-order logic with the predicate for the team occurring only nega- 
tively. Very recently, Pietro Galliani [3] has also solved the problem for independence 
logic, showing that actually all NP-properties of teams can be defined in independence 
logic. 
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